Language modeling for content extraction in human-computer dialogues

نویسندگان

  • Wolfgang Reichl
  • Bob Carpenter
  • Jennifer Chu-Carroll
  • Wu Chou
چکیده

In this paper we discuss the role of language modeling in a novel natural language dialogue system designed to automatically route incoming customer calls. We arrive at two significant conclusions: First, standard word error rate measures do not reflect application specific requirements; highly reliable content extraction is possible with relatively high word error rates. Secondly blending human-human data with human-machine data did not improve the performance in language modeling.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Data Extraction using Content-Based Handles

In this paper, we present an approach and a visual tool, called HWrap (Handle Based Wrapper), for creating web wrappers to extract data records from web pages. In our approach, we mainly rely on the visible page content to identify data regions on a web page. In our extraction algorithm, we inspired by the way a human user scans the page content for specific data. In particular, we use text fea...

متن کامل

Topic Identification in Natural Language Dialogues Using Neural Networks

In human–computer interaction systems using natural language, the recognition of the topic from user’s utterances is an important task. We examine two different perspectives to the problem of topic analysis needed for carrying out a successful dialogue. First, we apply selforganized document maps for modeling the broader subject of discourse based on the occurrence of content words in the dialo...

متن کامل

Translation of Anthroponyms in Children’s Cartoons: A Comparative Analysis of English Dialogues and Persian Subtitles

  The impact of animated cartoons on children has already been emphasized by quite many researchers. The present study aimed to investigate the strategies Iranian subtitlers of English animated cartoons used in re n- dering English anthroponyms in cartoons. To this aim, two theoretical frameworks were employed: Van Coillie's Model of Translating Proper Names and Fernandes’s Model of Proper N...

متن کامل

A Supervised Method for Constructing Sentiment Lexicon in Persian Language

Due to the increasing growth of digital content on the internet and social media, sentiment analysis problem is one of the emerging fields. This problem deals with information extraction and knowledge discovery from textual data using natural language processing has attracted the attention of many researchers. Construction of sentiment lexicon as a valuable language resource is a one of the imp...

متن کامل

Automating the Extraction of User Model Information from Consultation Dialogues Automating the Extraction of User Model Information from Consultation Dialogues

This thesis addresses a natural language processing problem posed in the context of so-called Web assistant systems aka live help systems. A recent feature added to a growing number of Web sites, such systems o er user support via text chat with human assistants. To adapt consultation to the individual user, long-term information about his or her skills and interests is collected in a user mode...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998